智能论文笔记

From global to local MDI variable importances for random forests and when they are Shapley values

Antonio Sutera , Gilles Louppe , Van Anh Huynh-Thu , Louis Wehenkel , Pierre Geurts

分类： (统计)机器学习 | 机器学习

2021-11-03

随机森林已被广泛用于其提供的所谓重要措施，在输入变量的相关性来预测某一输出全局（每个数据集）级洞察能力。在另一方面，根据沙普利值方法已被引入特征相关的基于树的模型分析细化到本地（每个实例）的水平。在这种情况下，我们首先证明杂质（MDI）变量重要性得分的全球平均减少对应的Shapley值在某些条件下。然后，我们推导出变量相关的本地MDI重要的措施，这与全球MDI衡量一个非常自然的连接，并且可以与局部特征相关的一个新概念。我们进一步联系当地MDI重要性有关与沙普利值和从文献中有关措施的光进行讨论。这些措施是通过实验在几个分类和回归问题的说明。

translated by 谷歌翻译

Soft Robotic Link with Controllable Transparency for Vision-based Tactile and Proximity Sensing

Quan Khanh Luu , Dinh Quang Nguyen , Nhan Huu Nguyen , Van Anh Ho

分类：机器人

2022-11-07

Robots have been brought to work close to humans in many scenarios. For coexistence and collaboration, robots should be safe and pleasant for humans to interact with. To this end, the robots could be both physically soft with multimodal sensing/perception, so that the robots could have better awareness of the surrounding environment, as well as to respond properly to humans' action/intention. This paper introduces a novel soft robotic link, named ProTac, that possesses multiple sensing modes: tactile and proximity sensing, based on computer vision and a functional material. These modalities come from a layered structure of a soft transparent silicon skin, a polymer dispersed liquid crystal (PDLC) film, and reflective markers. Here, the PDLC film can switch actively between the opaque and the transparent state, from which the tactile sensing and proximity sensing can be obtained by using cameras solely built inside the ProTac link. In this paper, inference algorithms for tactile proximity perception are introduced. Evaluation results of two sensing modalities demonstrated that, with a simple activation strategy, ProTac link could effectively perceive useful information from both approaching and in-contact obstacles. The proposed sensing device is expected to bring in ultimate solutions for design of robots with softness, whole-body and multimodal sensing, and safety control strategies.

translated by 谷歌翻译

Detecting Spam Reviews on Vietnamese E-commerce Websites

Co Van Dinh , Son T. Luu , Anh Gia-Tuan Nguyen

分类：自然语言处理 | 人工智能

2022-07-27

客户的评论在在线购物中起着至关重要的作用。人们经常参考以前客户的评论或评论，以决定是否购买新产品。赶上这种行为，有些人会为骗子的客户创建不真实的评论，以了解产品的假质量。这些评论称为垃圾邮件评论，它使消费者在在线购物平台上混淆，并对在线购物行为产生负面影响。我们提出了称为Vispamreviews的数据集，该数据集具有严格的注释程序，用于检测电子商务平台上的垃圾邮件评论。我们的数据集由两个任务组成：用于检测评论是否为垃圾邮件的二进制分类任务以及用于识别垃圾邮件类型的多类分类任务。Phobert在这两个任务上均以宏平均F1分别获得了最高的结果，分别为88.93％和72.17％。

translated by 谷歌翻译

XLMRQA: Open-Domain Question Answering on Vietnamese Wikipedia-based Textual Knowledge Source

Kiet Van Nguyen , Phong Nguyen-Thuan Do , Nhat Duy Nguyen , Tin Van Huynh , Anh Gia-Tuan Nguyen , Ngan Luu-Thuy Nguyen

分类：自然语言处理

2022-04-14

问题回答（QA）是信息检索和信息提取领域内的一项自然理解任务，由于基于机器阅读理解的模型的强劲发展，近年来，近年来，近年来的计算语言学和人工智能研究社区引起了很多关注。基于读者的质量检查系统是一种高级搜索引擎，可以使用机器阅读理解（MRC）技术在开放域或特定领域特定文本中找到正确的查询或问题的答案。 MRC和QA系统中的数据资源和机器学习方法的大多数进步尤其是在两种资源丰富的语言中显着开发的，例如英语和中文。像越南人这样的低资源语言见证了关于质量检查系统的稀缺研究。本文介绍了XLMRQA，这是第一个在基于Wikipedia的文本知识源（使用UIT-Viquad语料库）上使用基于变压器的读取器的越南质量检查系统，使用深层神经网络模型优于DRQA和BERTSERINI，优于两个可靠的QA系统分别为24.46％和6.28％。从三个系统获得的结果中，我们分析了问题类型对质量检查系统性能的影响。

translated by 谷歌翻译

Dynamic transformation of prior knowledge into Bayesian models for data streams

Tran Xuan Bach , Nguyen Duc Anh , Ngo Van Linh , Khoat Than

分类：机器学习 | (统计)机器学习

2020-03-13

我们考虑如何在从流环境中学习贝叶斯模型时有效地使用先验知识，其中数据无限依次出现。这个问题在数据爆炸时代非常重要，富有培训的模型，本体，维基百科等珍贵外部知识的富裕来源非常重要。我们表明一些现有的方法可以忘记任何知识。然后，我们提出了一种新颖的框架，使能够将不同形式的先验知识纳入基础贝叶斯模型的数据流。我们的框架载有一些现有的时序/动态数据的流行模型。广泛的实验表明，我们的框架优于具有大边距的现有方法。特别是，我们的框架可以帮助贝叶斯模型在极短的文本上概括，而其他方法过度装备。我们的框架的实施是在https://github.com/bachtranxuan/tps.git上获得的。

translated by 谷歌翻译

LoopDraw: a Loop-Based Autoregressive Model for Shape Synthesis and Editing

Nam Anh Dinh , Haochen Wang , Greg Shakhnarovich , Rana Hanocka

分类：计算机视觉

2022-12-09

There is no settled universal 3D representation for geometry with many alternatives such as point clouds, meshes, implicit functions, and voxels to name a few. In this work, we present a new, compelling alternative for representing shapes using a sequence of cross-sectional closed loops. The loops across all planes form an organizational hierarchy which we leverage for autoregressive shape synthesis and editing. Loops are a non-local description of the underlying shape, as simple loop manipulations (such as shifts) result in significant structural changes to the geometry. This is in contrast to manipulating local primitives such as points in a point cloud or a triangle in a triangle mesh. We further demonstrate that loops are intuitive and natural primitive for analyzing and editing shapes, both computationally and for users.

translated by 谷歌翻译

Multiple Perturbation Attack: Attack Pixelwise Under Different $\ell_p$-norms For Better Adversarial Performance

Ngoc N. Tran , Anh Tuan Bui , Dinh Phung , Trung Le

分类：计算机视觉 | 机器学习

2022-12-05

Adversarial machine learning has been both a major concern and a hot topic recently, especially with the ubiquitous use of deep neural networks in the current landscape. Adversarial attacks and defenses are usually likened to a cat-and-mouse game in which defenders and attackers evolve over the time. On one hand, the goal is to develop strong and robust deep networks that are resistant to malicious actors. On the other hand, in order to achieve that, we need to devise even stronger adversarial attacks to challenge these defense models. Most of existing attacks employs a single $\ell_p$ distance (commonly, $p\in\{1,2,\infty\}$) to define the concept of closeness and performs steepest gradient ascent w.r.t. this $p$-norm to update all pixels in an adversarial example in the same way. These $\ell_p$ attacks each has its own pros and cons; and there is no single attack that can successfully break through defense models that are robust against multiple $\ell_p$ norms simultaneously. Motivated by these observations, we come up with a natural approach: combining various $\ell_p$ gradient projections on a pixel level to achieve a joint adversarial perturbation. Specifically, we learn how to perturb each pixel to maximize the attack performance, while maintaining the overall visual imperceptibility of adversarial examples. Finally, through various experiments with standardized benchmarks, we show that our method outperforms most current strong attacks across state-of-the-art defense mechanisms, while retaining its ability to remain clean visually.

translated by 谷歌翻译

Improving Pareto Front Learning via Multi-Sample Hypernetworks

Long Phi Hoang , Dung Duy Le , Tuan Anh Tran , Thang Tran Ngoc

分类：机器学习

2022-12-02

Pareto Front Learning (PFL) was recently introduced as an effective approach to obtain a mapping function from a given trade-off vector to a solution on the Pareto front, which solves the multi-objective optimization (MOO) problem. Due to the inherent trade-off between conflicting objectives, PFL offers a flexible approach in many scenarios in which the decision makers can not specify the preference of one Pareto solution over another, and must switch between them depending on the situation. However, existing PFL methods ignore the relationship between the solutions during the optimization process, which hinders the quality of the obtained front. To overcome this issue, we propose a novel PFL framework namely \ourmodel, which employs a hypernetwork to generate multiple solutions from a set of diverse trade-off preferences and enhance the quality of the Pareto front by maximizing the Hypervolume indicator defined by these solutions. The experimental results on several MOO machine learning tasks show that the proposed framework significantly outperforms the baselines in producing the trade-off Pareto front.

translated by 谷歌翻译

QC-StyleGAN -- Quality Controllable Image Generation and Manipulation

Dat Viet Thanh Nguyen , Phong Tran The , Tan M. Dinh , Cuong Pham , Anh Tuan Tran

分类：计算机视觉 | 人工智能

2022-12-02

The introduction of high-quality image generation models, particularly the StyleGAN family, provides a powerful tool to synthesize and manipulate images. However, existing models are built upon high-quality (HQ) data as desired outputs, making them unfit for in-the-wild low-quality (LQ) images, which are common inputs for manipulation. In this work, we bridge this gap by proposing a novel GAN structure that allows for generating images with controllable quality. The network can synthesize various image degradation and restore the sharp image via a quality control code. Our proposed QC-StyleGAN can directly edit LQ images without altering their quality by applying GAN inversion and manipulation techniques. It also provides for free an image restoration solution that can handle various degradations, including noise, blur, compression artifacts, and their mixtures. Finally, we demonstrate numerous other applications such as image degradation synthesis, transfer, and interpolation.

translated by 谷歌翻译

Wavelet Diffusion Models are fast and scalable Image Generators

Hao Phung , Quan Dao , Anh Tran

分类：计算机视觉

2022-11-29

Diffusion models are rising as a powerful solution for high-fidelity image generation, which exceeds GANs in quality in many circumstances. However, their slow training and inference speed is a huge bottleneck, blocking them from being used in real-time applications. A recent DiffusionGAN method significantly decreases the models' running time by reducing the number of sampling steps from thousands to several, but their speeds still largely lag behind the GAN counterparts. This paper aims to reduce the speed gap by proposing a novel wavelet-based diffusion structure. We extract low-and-high frequency components from both image and feature levels via wavelet decomposition and adaptively handle these components for faster processing while maintaining good generation quality. Furthermore, we propose to use a reconstruction term, which effectively boosts the model training convergence. Experimental results on CelebA-HQ, CIFAR-10, LSUN-Church, and STL-10 datasets prove our solution is a stepping-stone to offering real-time and high-fidelity diffusion models. Our code and pre-trained checkpoints will be available at \url{https://github.com/VinAIResearch/WaveDiff.git}.

translated by 谷歌翻译